Web Metasearch as Belief Aggregation

نویسنده

  • Sergio A. Alvarez
چکیده

Web metasearch requires a mechanism for combining rank-ordered lists of ratings returned by multiple search engines in response to a given user query. We view this as being analogous to the need for combining degrees of belief in probabilistic and uncertain reasoning in artificial intelligence. This paper describes a practical method for performing web metasearch based on a novel transformationbased theory of belief aggregation. The consensus ratings produced by this method take into account the item ratings/rankings output by individual search engines as well as the user’s preferences. Copyright (~) 2000, American Association for Artificial Intelligence (www.aaai.org). All rights reserved. Introduction Web search engines (WSE) use tools ranging from simple text-based search to more sophisticated methods that attempt to understand the intended meanings of both queries and data items. There has been much work in this area in recent years. The link structure of the web has been used to understand the relationships between documents (Chakrabarti et al. 1999). Machine learning techniques have been applied to web search (McCallum et al. 1999), (Boyan, Freitag, & Joachims 1996). Specialized agents that mine the web have been described (Doorenbos, Etzioni, & Weld 1997). Light is shed on web search from a different perspective by work on human behavior (Macskassy et al. 1998). Related problems include those of intelligently recommending scientific papers (Basu et al. 1999) and creating digital libraries for efficient indexing and retrieval Of scientific documents (Lawrence, Bollacker, & Giles 1999). Reviews of work in web searching include (Lawrence & Giles 1999), (Filman & (guest editors) 1998), (Lawrence & Giles 1998). We are interested in web metasearch engines (MSE) (Selberg & Etzioni 1995), (Glover et al. 1999), which dispatch user queries to several available WSE; each WSE produces an ordered list of data items in response to the query, and the MSE combines these lists into a single summary list that is then passed on to the user. In the present paper we present a new approach to web metasearching. Numerical relevance ratings are provided as part of our method’s output. A useful feature of our approach is that it allows the user to give subjective confidence values for the particular WSE being employed. These confidence values determine the relative importance accorded to the different WSE when producing the final search summary that the user receives as output. Our approach is based on a framework (Alvarez 1997), (Alvarez 2000) provides a set of tools with which to systematically construct combination operators for belief aggregation, each determined by a different choice of geoFrom: AAAI Technical Report WS-00-01. Compilation copyright © 2000, AAAI (www.aaai.org). All rights reserved. metric transformation or reference frame in an abstract space (in the present case this space is the space of relevance ratings). Combination operators allow one to uniformly assign numerical relevance ratings to the items found by the WSE being polled by the system, and thus ultimately to produce the final summary list. Our approach assumes that the WSE return numerical ratings in addition to rankordered lists of hits. If this is not the case, ratings may be assigned to rankings in some way before combination is to be performed. Algorithms for combining rankings when numerical ratings are not available have been studied previously, e.g. (Freund et al. 1998). High flexibility and configurability are two properties that our approach inherits from the theoretical framework of (Alvarez 2000). Our framework provides a natural mechanism to vary the sensitivities of the resulting combination operators to their various inputs. This allows a system based on this approach to adapt according to the user’s preferences.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Metasearch information fusion using linear programming

For a specific query merging the returned results from multiple search engines, in the form of a metasearch aggregation, can provide significant improvement in the quality of relevant documents. This paper suggests a minimax linear programming (LP) formulation for fusion of multiple search engines results. The paper proposes a weighting method to include the importance weights of the underlying...

متن کامل

Advanced Metasearch Engine Technology

Among the search tools currently on the Web, search engines are the most well known thanks to the popularity of major search engines such as Google and Yahoo!. While extremely successful, these major search engines do have serious limitations. This book introduces large-scale metasearch engine technology, which has the potential to overcome the limitations of the major search engines. Essential...

متن کامل

Search Result Merging and Ranking Strategies in Meta-Search Engines: A Survey

MetaSearch is utilizing multiple other search systems to perform simultaneous search. A MetaSearch Engine (MSE) is a search system that enables MetaSearch. To perform a MetaSearch, user query is sent to multiple search engines; once the search results returned, they are received by the MSE, then merged into a single ranked list and the ranked list is presented to the user. When a query is submi...

متن کامل

Text and Image Metasearch on the Web

As the Web continues to increase in size, the relative coverage of Web search engines is decreasing, and search tools that combine the results of multiple search engines are becoming more valuable. This paper provides details of the text and image metasearch functions of the Inquirus search engine developed at the NEC Research Institute. For text metasearch, we describe features including the u...

متن کامل

Effective rank aggregation for metasearching

Nowadays,mashup services and especiallymetasearch engines play an increasingly important role on the Web. Most of users use them directly or indirectly to access and aggregate information from more than one data sources. Similarly to the rest of the search systems, the effectiveness of a metasearch engine is mainly determined by the quality of the results it returns in response to user queries....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000